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AMENDMENTS TO THE CLAIMS 

This listing of claims will replace all prior versions, and listings, of claims in the application: 
Listing of Claims 

1. - 11. (Canceled) 

12. (Currently Amended) A computer implemented method performed by a suitably 
programmed computer for inferring genomic sequences unique to at least one set of organisms 
other than a set of organisms under investigation, the method comprising: 

obtaining genomic data characteristic of a set of organisms under investigation; 

formatting by a computer program embodied on a computer-readable medium the 

genomic data into at least one query-length sequence, each query-length sequence 
being of a format compatible with a similarity search engine program ; 

searching , by the similarity search engine program, a selected genomic computer 
database using the query-length sequence and the similarity search engine 
program for those sequences having homology above a threshold with at least one 
set of organisms other than the set under investigation, 

the selected genomic computer database containing genomic data from a plurality 
of organisms; 

parsing results of the search for a sequence unique to the selected genomic database; 
identifying from the selected genomic database the results of the search that are not a 

unique seq uence and re-evaluating a subset of those results for unique sequences 
removing the sequences from the selected genomic database that are not unique; and 
outputting by the computer to a user an identity of those sequences unique to the selected 

genomic database. 

13. (Currently Amended) A computer-implemented system for inferring genomic sequences 
unique to a at least one set of organisms other than a set of organisms under investigation, the 
computer-implemented system comprising: 



WDCIMANAGE- 1216302vl-14682.105607 



Page 2 of 7 



SAIC0086 PATENT 
Serial No. 10/762,763 

a computer program embodied on a computer-readable physical medium , the computer 
program comprising a genomic data interface module, a formatting module, a 
search interface module, a search results parsing module, and a removal module ; 

a the genomic data interface module , stored on the medium and operable to couple 

couples to a source of genomic data to receive genomic data characteristic of a set 
of organisms under investigation; 

a the formatting module , stored on the medium and operable to format formats received 
genomic data into at least one query-length sequence, each query-length sequence 
being of a format compatible with a similarity search engine; 

a the search interface module , stored on the medium and operable to interface interfaces 
with the similarity search engine to submit the query-length sequence to a 
selected genomic database containing genomic data from a substantial plurality of 
organisms; and 

a the search results parsing module , stored on the medium and operable to parse parses 
results of the search for those sequences having homology above a threshold with 
at least one set of organisms other than the set under investigation and otherwise 
unique within the selected genomic database, and to output outputs to a user an 
identity of those sequences having homology above a threshold with at least one 
set of organisms other than the set under investigation and otherwise unique 
within the selected genomic database ; and 

the removal module removes the sequences from the selected genomic database that are 
not unique; 

wherein a subset of the results of the search that are not a unique sequence are re- 
evaluated by the search interface module . 

14. (Currently Amended) A computer program, comprising a computer-readable medium having 
a computer readable program code embodied therein, the computer readable program code 
adapted to be executed to implement a computer - implemented method for inferring genomic 
sequences unique to a first set of organisms, the method comprising: 
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providing a system, wherein the system comprises distinct software modules, and 

wherein the distinct software modules comprise a formatting module, a similarity 
search engine module, and a parsing module; 

obtaining genomic data characteristic of a second set of organisms; 

formatting by a computer the second set genomic data into at least one query-length 
sequence, each query-length sequence being of a format compatible with a the 
similarity search engine module, wherein the formatting is performed by the 
formatting module ; 

searching by the computer similarly search engine module a selected genomic database 
using the formatted query-length sequence and the similarity search engine , the 
selected genomic database containing genomic data from a plurality of organisms 
including the first set of organisms; 

parsing results of the search for those sequences from the similarity search engine 
module , other than sequences of the second set, having homology above a 
threshold with the second set and otherwise unique within the selected genomic 
database , wherein the parsing is performed by the parsing module; 

identifying from the selected genomic database the results of the search that arc not a 

unique sequence and re-evaluating a subset of those results for unique sequences; 

removing the sequences from the selected genomic database that are not unique ; and 

outputting by the computer to a user an identity of those sequences having homology 
above a threshold with the second set and otherwise unique within the selected 
genomic database as genomic sequences unique to the first set. 

15. (Canceled) 
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